GMM-Based Identification of Indonesian Speech
نویسندگان
چکیده
This paper reports the performance of identification of Indonesian speech within a ten-language corpus: English, German, Hungarian, Indonesian, Italian, Korean, Mandarin, Polish, Portuguese, and Swedish. The tasks are performed by implementing Gaussian Mixture Model (GMM) on MelFrequency Cepstral Coefficients (MFCCs). Two types of experiments that have been undertaken: pair-wise and tenlanguage experiments. In the pair-wise experiments, the performance of the model in identifying Indonesian within every language pair is evaluated. The experiments show that Indonesian is best distinguished from English, Korean, and Portuguese with 90.5% of accuracy. In ten-language experiments, the highest accuracy of identifying Indonesian is 85.71%.
منابع مشابه
مقایسه روش های طیفی برای شناسایی زبان گفتاری
Identifying spoken language automatically is to identify a language from the speech signal. Language identification systems can be divided into two categories, spectral-based methods and phonetic-based methods. In the former, short-time characteristics of speech spectrum are extracted as a multi-dimensional vector. The statistical model of these features is then obtained for each language. The ...
متن کاملUsing Context-based Statistical Models to Promote the Quality of Voice Conversion Systems
This article aims to examine methods of optimizing GMM-based voice conversion systems performance in which GMM method is introduced as the basic method for improvement of voice conversion systems performance. In the current methods, due to using a single conversion function to convert all speech units and subsequent spectral smoothing arising from statistical averaging, we will observe quality ...
متن کاملDiscriminative Training of GMM for Language Identificatio..
In this paper, a discriminative training procedure for a Gaussian Mixture Model (GMM) language identification system is described. The proposal is based on the Generalized Probabilistic Descent (GPD) algorithm and Minimum Classification Error Rates formulated to estimate the GMM parameters. The evaluation is conducted using the OGI multi-language telephone speech corpus. The experimental result...
متن کاملThe Approach of Speaker Diarization by Gaussian Mixture Model (GMM)
Speaker identification is an important activity in the process of speaker diarization. We need to model the speaker by Gaussian mixture model (GMM) for speaker identification purpose. Large GMM is called as a Universal Background Model (UBM) which is adapted into each speaker model for speaker identification purpose. This paper focuses on speech clustering for speaker diarization. The speaker d...
متن کاملPerformance Analysis of Speaker Identification System Using GMM with VQ
Personal identity identification is an important requirement for controlling access to protected resources. Biometric identification by using certain features of a person is a more secured solution for security identification. Advances in speech processing technology and digital signal processors have made possible the design of high-performance and practical speaker recognition systems. A more...
متن کامل